Robust Features and Environmental Compensation: a Few Comments 2. Features and Models

نویسنده

  • Nelson Morgan
چکیده

1. GENERAL COMMENTS This is a brief note to comment on a few points related to two excellent keynote papers by Greenberg 3] and by Stern et al 5]. In a sense, Stern's paper describes the current technology; in particular, approaches to adjusting ASR systems based on phone or sub-phone-based HMMs in order to improve performance in the presence of noise and linear channel eeects. On the other hand, Greenberg's paper gives a direction for the future, focus-ing on aspects of spoken language that he does not believe our current systems incorporate. At rst glance, the papers might seem almost unrelated. Greenberg's paper focuses on characteristics of conversational speech that indicate limitations of current ASR technology. He suggests a wide-ranging multi-tiered strategy as the fundamental solution to the poor performance that is observed for unexpected testing conditions with machine recogniz-ers. Stern's paper is descriptive of the approaches to noise and channel robustness developed at CMU and elsewhere over the last decade, and as such is a good review of what can be done with the techniques that Greenberg criticizes. The papers are not really contradictory; faced with the requirement of improving recognition performance a good engineer will both consider new directions and also maximally exploit the existing ones. The CMU group has placed considerable emphasis on exploiting a range of solutions to linear disturbances, including both model-based and feature-based compensations. When information about the nature of the disturbance (or about the \clean" signal) is available, methods pioneered by the CMU group show the extent to which the problem can be reduced. Other methods show how iterative approaches (EM) can be used to improve the probability estimates despite interfering signals or convolutional error. We do not yet know what engineering techniques will be required in order to implement a system incorporating all the levels that Greenberg suggests, but when we do it is likely that a real implementation will be statistical, and as such will still require mathematical characterizations such as the ones Stern presents (though perhaps not these same ones). Stern's taxonomy of compensation strategies consists of three classes of approaches: feature modiication to match an undegraded signal (which he calls empirical, and which will not be discussed further here); model-based compensation , in which statistical model parameters are modiied during testing; and what he refers to as cepstral high-pass ltering, which will be discuss further in the next section. …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

Environmental tranquility: A conceptual framework and urban ‎architectural features

Stressful life and reduced well-being have always been an issue of lifestyle in modern society. Constructing a multidisciplinary conceptual framework of environmental tranquility and quality of life is required for the field of architectural development, improved environmental quality, and enhanced human well-being. This paper reviews the main concepts of tranquility, environmental quality, and...

متن کامل

Radiomics modelling of IMRT induced acute rectal toxicity using clinical and magnetic resonance imaging features

Introduction: Rectal toxicity is a dose limiting issue in prostate cancer radiotherapy. Prediction of these effects may be used to tailor the therapy. The purpose of this work was to develop predictive radiomic models based on clinical, dosimetric and radiomic features extracted from rectal wall magnetic resonance image (MRI).   Materials and Methods: This st...

متن کامل

Offline Language-free Writer Identification based on Speeded-up Robust Features

This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...

متن کامل

DEM-based analysis of morphometric features in humid and hyper-arid environments using artificial neural network

Abstract This paper presents a robust approach using artificial neural networks in the form of a Self Organizing Map (SOM) as a semi-automatic method for analysis and identification of morphometric features in two completely different environments, the Man and Biosphere Reserve “Eastern Carpathians” (Central Europe) in a complex mountainous humid area and Yardangs in Lut Desert, Iran, a hyper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997